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1. Introduction 

In the present work we introduce and study a novel stocliastic growtli process, tlie 
Record-driven (RD) growth process, which originates from the analysis of a class of 
growing networks in a universal limiting regime. As we shall see, this process is also 
related to three other fields, namely the statistics of the cycles of random permutations, 
the statistical properties of randomly broken intervals, and the Kesten variable. 

The RD growth process originates from a class of growing networks with a 
preferential attachment rule, amongst which the most well known representative is the 
model of Barabasi and Albert (BA) [T]. The latter provides a natural explanation 
for the main features observed in real networks [21 El lU [5], and chiefly their scale- 
freeness, testified by the power-law fall-off of their degree distribution. Relevant for 
our purpose are networks where the attachment rule involves both the degree of the 
nodes and their intrinsic quality or fitness, an extension of the original BA model which 
is due to Bianconi and Barabasi (BB) [6J. The BB model has the remarkable feature 
that it exhibits a continuous condensation transition, analogous to the Bose-Einstein 
condensation. 

A general definition of this class of networks is as follows. The network being 
initially empty, at each integer time step, n = 1,2,..., a new node, labeled by its birth 
date n, is added. Node n is endowed once and for all with an intrinsic quality ?7„, modeled 
by a quenched random variable drawn from some given distribution. This node (except 
for the first one) connects by one link to any of the earlier nodes (z = l,...,n— 1) with 
probability 

P,,„ = (1.1) 

where h{ki) is a function of the degree ki of node i, i.e., the number of nodes already 
connected to node i at time n — 1. The denominator 

n-l 

Zr,.i = Y.r]^h{h) (1.2) 

1=1 

is a normalization factor. The probability Pi^n is thus proportional to an intrinsic factor, 
the quality rji of node i, and to a dynamical one, represented by h{ki). Preferential 
attachment to nodes whose degree is already larger is realized whenever the function 
h{ki) is an increasing function of the degree k^. Finally, temperature T is introduced 
into the model by considering the qualities as activated variables, i.e., by setting 

r]n = exp(-£:„/T), (1.3) 

and assuming that the activation energies En are drawn from a given temperature- 
independent continuous distribution. 

The BA and BB models in their original forms both correspond to the linear 
function h{k) = k. The BA model corresponds to the limiting case of infinite 
temperature. 
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After time step n is completed the network consists of n nodes connected by n — 1 
links. It therefore has the topology of a tree. In the rest of the paper we use a slightly 
modified definition of the degree, keeping abusively the same notation, fcj. We define 
the degree of a given node as the number of incoming links on this node, taking aside 
its unique outgoing link, except for the first node, which has no outgoing link. This new 
definition corresponds to shifting by one unit the former quantity, ki —>■ ki — 1, except 
for the first node. The degrees thus defined sum up to the number of links, i.e., 

n 

J2h = n-1. (1.4) 

i=l 

As we now explain, the RD growth process is the zero-temperature limit of the 
class of models described above. In this limit, at any time n the node i with the lowest 
energy: 

= min(ei, . . . (1.5) 

i.e., with the highest quality, has an attachment probability pi^n which is overwhelmingly 
larger than all the other ones, since the ratio Pi^n/Pj,n grows exponentially at low 
temperature, as exp{{ej — e.i)/T). Every new node n therefore connects to the earlier 
node i with best quality at time n, given by (11. 5p . The successive best qualities are 
known as records [3 [8], [9], |10] . The corresponding nodes, that we term record nodes, 
are therefore the only ones whose degree grows. This process is universal in a very 
strong sense. It is independent of the function h{k) and of the distribution of the node 
energies, provided the latter is continuous, so that the event Si = Sj with i ^ j has zero 
probability. In particular, the RD growth process is independent of whether the model 
has a condensation transition or not. 

Let us summarize the definition of the process and give a pictorial interpretation 
of it. Nodes (or individuals) arrive one by one to form a network of relationships. The 
network being initially empty, at each integer time step, = 1, 2, . . ., a new node, labeled 
by its birth date n, is added, and endowed once and for all with an intrinsic quality rjn- 
This node connects by one link to the earlier node with best quality. This directed link 
can be pictorially described as "being a disciple of". Thus groups are formed around 
record nodes (or pictorially "masters"). The size of a group, or the degree of a record 
node, is the number of incoming links to it (i.e., the number of disciples). Times at 
which a node appears with a quality that breaks the previous record are record times. 
Finally, there is another, yet simpler description of the process relying on records only: 
the record times are the dates of birth of the record nodes; the degree of the newly born 
record node grows linearly in time, then stops growing when the next record node is 
born. 

In the present work the main emphasis will be put on the interplay between records 
and leaders. While a node is a record if its quality (an intrinsic property) is better 
than those of all the earlier nodes, a node will be said to be a leader at a given time if 
its degree (a dynamical, time-dependent quantity) is larger than those of all the other 
nodes. Investigating the interplay between records and leaders is natural in the low 
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temperature regime of the BB model because the question is whether the best fitted 
node will or will not be the leader in the course of time. The statistics of leaders and 
of lead changes has been addressed previously for the BA and related models in [TTj . 
In the present case these subtle questions can be analytically approached and given a 
comprehensive quantitative answer. 

The main outcomes of this paper are the following. We first determine the 
invariant measure associated with the dynamical system fl3.17p . This measure gives the 
asymptotic distribution of the fraction of nodes connected to a leader, or alternatively 
that of the longest inter-record interval. The knowledge of the invariant measure 
allows the computation of the probability for a node to be a leader, or equivalently 
the probability of a record breaking for the sequence of inter-record intervals. We 
find that this probability is equal to the Golomb-Dickman constant u = 0.624 329 . . ., 
eq. (14.281) . We then perform the analysis of the statistics of the difference of the labels of 
two successive leaders (equivalently, of two successive records for the sequence of inter- 
record intervals), as well as a thorough study of the statistics of the lengths of time 
associated with the reign of a leader. We finally explain the connections of this process 
with the statistics of the cycles of random permutations, the statistical properties of 
randomly broken intervals, and the Kesten variable. 

The bulk of the paper begins at section [31 In the next section we first establish 
the needed background knowledge on records, since the latter play a fundamental role 
in the definition of the process under study. 

2. Statistics of records 

The discrete theory of records is classical [3, [HI [9l [10] . The continuum theory, which is 
instrumental in our work, is less documented, as is its relationship to a renewal process. 

2.1. Discrete theory 

Given a sequence of numbers, gi, g2, • • •, the value qi is a record if it is larger than all 
previous ones: 



These numbers are for example the successive observations of a random signal, or 
the successive drawings of a random variable, modeled as a sequence of independent 
identically distributed (i.i.d.) random variables. In order to avoid ambiguities due to 
ties, the distribution of the latter is taken continuous. In the present context, the 
stand for the qualities rji of the nodes. 

Referring to the label z = 1, 2, ... as time, the time i of the occurrence of a record 
is a record time. The definition of a record involving only inequalities between the 
variables qi, the statistics of record times is independent of the underlying distribution 
of these variables. 
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The first value qi is always a record. The occurrence of a record at any subsequent 
time i >2 has probability 

Prob(gi > max(gi, . . . , = -. (2.2) 

This holds independently of the occurrence of any other record, either at earlier times 
(j < i) or at later times (j > i). Indeed, qi is a record with probability 1. The 
probability that q2 be a record, i.e., be larger than gi, is obviously equal to 1/2. The 
probability that ^3 be a record is equal to 1/3 because it can be smaller, intermediate 
or larger than the two previous values qi and q2, with equal probability, and so on. 
Eq. fl2.2l) can alternatively be recovered by noticing that, amongst the i\ permutations 
of gi, . . . , gj, there are {i — 1)! permutations where qi is the largest. 

Eq. (12. 2p means that the rate of record breaking is equal to or otherwise stated 
that the occurrence of a record is a Bernoulli process with success probability equal 
to The indicator variables /j, equal to 1 if qi is a record and to otherwise, will be 
the building blocks for the derivation of the results of this section. It is easy to convince 
oneself that they are independent (for a proof, see [3 IH [9l |10] ) . We will first determine 
the distribution of the number of records up to time n, then that of the record times. 

The number M„ of records up to time n is a random variable taking the values 
1, . . . , n, which can be expressed as 

M„ = Ji + /2 + ■ ■ ■ + 4. (2.3) 
Its average and variance read 

n ^ 1 

{M^) = Y^{L^ = Y^-^\nn + j, (2.4) 

i=i 1=1 * 

n " / 1 1 \ TT^ 

varM„ = 5^(1,(1 - /,)) = ^ - - -U. Inn + 7 - — . (2.5) 

i=l 1=1^^^^ O 

where 7 = 0.577215... is Euler's constant. More generally, the generating function 
of Mn reads 

(x^") = f]{x'') = f\(l + ^] = '^(^ + '^)---(^ + ^-'^) _ (2.6) 

The 'rising power' appearing in the right-hand side of (12.61) is known to be the generating 
function of the Stirling numbers of the first kind [12j: 

n 



, r(x + n) " 



m 



x™. (2.7) 



The Stirling number of the first kind ^ is the number of ways of arranging n objects 
in m cycles, or the number of permutations of n objects having m cycles, hence 



m=l 



=< (2.8) 

_m 
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consistently with fl2.7p . setting x = 1. The distribution of M„ is therefore given by 

n 

Prob(M„ = m) = (2.9) 
n! 

In other words, the number M„ of records up to time n is distributed as the number of 
cycles in a random permutation on n elements (see section 16.11 for further developments 
on the connection of records with random permutations). 
The asymptotic form of the Stirling numbers 

m (n - 1)! (lnn)™-i 

^ ^ ^ — , (2.10) 



m 



(m-1)! 

is readily derived from (12.71) by simplifying the gamma functions for large n and small x 
as r(a; + n) ~ (ra — l)\n^ and r(a;) ~ 1/x. As a consequence, the asymptotic form of 
the distribution of Mn reads 

Prob(M„ = m) ^ -) (2.11) 

n [m — ly. 

The bulk of the distribution of M„ is thus, up to a shift by one unit, asymptotically given 
by a Poissonian law of parameter In n. This holds only to leading order in In ra. The exact 
large-n behavior of the mean and variance of M„, eqs. (12. 4p and (12.51) . differs by additive 
constants from the predictions of ( 12. lip . (M„)poisson = Inn + 1, (var M„)poisson = Inn. 

Let us denote by Ni, N2 . . . , N^, ■ ■ ■ the successive record times. Thus Ni = 1, 
since qi is always a record. Then, if is the next record after qi, the second record 
time is N2 = 12, etc. The distribution of the m-th record time Nm follows from (12. 9p : 

" k-l ' 

Froh{Nra = k) = (A; = m,m + 1,...), (2.12) 

and m > 2. We have indeed the equivalence of events 

{Nm = k} = {Mk-1 = m-l,Mk = m} = {M^^i = m - 1, 4 = 1}, (2.13) 

where the two events appearing inside the rightmost brackets are independent. Using 
the asymptotic form (I2.10p one gets 

P-'^W" = *'-pW^ (2.14) 

for large values of k and m. 

The sequence Ni,N2 . . . , Nm, ... of record times can be generated recursively. We 

have 

Prob(iV„ = k\Nm-i = j) = Pk\j = J^j^y (2-15) 

independently of m, i.e., independently of the occurrence of earlier records. Indeed, 

Pk\i = Prob(Jj+i = ■ ■ ■ = = 0, Jfc = 1) 

= Prob(/j+i = 0) . . . Prob(/fc_i = 0) Prob(/fc = 1) 
J J + 1 k-2l 



j + 1 j + 2 k-l k'' 



(2.16) 
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because of the statistical independence of the elementary events. The product of 
fractions simplifies to fl2.15p . Consequently, 



or else 



Prob(iV^ > k\Nm-i = j) = J2pi\3 = jr^, (2.17) 

i>k ^ 

Prob(iV„ > A;|iV^_i) = (2.18) 

This expression can be recast in the form of the random recursion 

iV^ = l + Int-^, (2.19) 

where Int x denotes the integer part of the real number x, and Um is a uniform random 
variable between and 1, independent of A^m-i- 

We now look at the distribution of record times from a different viewpoint. For a 
fixed instant of time n > 1, we consider the time Nm of occurrence of the last record 
before n (n being included), that is the M„-th recorcQ. The time Nm is a discrete 
random variable uniformly distributed between 1 and n: 

Prob(A^M = k) = - (l<k<n). (2.20) 
n 

Indeed, the joint distribution of Nm and M„ reads 

Prob(A^Af = k,Mn = m)= Prob(4+i = . . . = J„ = 0) Prob(A/'„ = k) 

k 

= - FToh(N^ = k). (2.21) 
n 

for m < k < n. Using f l2.12p . (12.81) and summing (12.211) over m yields the result. 
Eq. (12.201) can be alternatively recovered by noticing that the time Nm is entirely 
characterized by the property that the signal qi takes its maximal value over i = 1, . . . ,n 
at time i = Nm- These n possible values of Nm are clearly equally probable. 

Similarly, consider the time A^a/+i of occurrence of the (M„ + 1)— st record, i.e., of 
the first record after a given time n. It is easily found, by the same reasoning leading 
to fl2J[5|) . that 

Prob(iVM+i = fc) = ^^^-^ {k>n + l). (2.22) 

Therefore 

from which one can infer that, when n oo, u/Nm+i is uniformly distributed between 
and 1. This will be proved below, by another method. 

I In the following we drop the subscript when it is attached to a quantity which is itself in a subscript, 
whenever there is no ambiguity. 
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2.2. Continuum theory 

At large times, the discrete theory of records has an asymptotically exact description 
in terms of continuous variables. This description stems from the recursion, inherited 
from (ICTD . 

Nm = (2.24) 

where the Nm are now considered as real numbers. The product structure of this 
recursion suggests to introduce a logarithmic scale of time. In this time scale the 
process of record times is transformed into a very simple renewal process [131 El US] , as 
shown below. This remark allows an easy access to the determination of the asymptotic 
distributions of the observables needed in the sequel (M„, Nm, Nm)- 
We set@ 

t = \nn, tm = \nNm+i, r„ = -lnf/^+i. (2.25) 

The Um being uniformly distributed between and 1, the increments are i.i.d. random 
variables with density p(r) = e""^. The recursion fl2.24p becomes 

tm = tm-l + Tm, (2.26) 

which defines the time of occurrence of the m-th renewal, 

tm = Ti + --- + Tm. (2.27) 

Its probability density ft^ is therefore equal to the m-th convolution product of p, 
corresponding to the gamma distribution: 

T^m— 1 

ft^{T)=e-^- -, (2.28) 

[m — 1)\ 

from which we formally deduce the density of A^m+i = exp(tm) 

in agreement with the asymptotic form (12.14^ of the law of Nm- This is the law of the 
inverse of the product of m uniform random variables between and 1. 

The number of records up to time n, M„, translates into the number Mt of renewals 
up to time t, up to a shift by one unit between the two quantities, 

M„ = M + 1. (2.30) 

In the present case of exponentially distributed increments, the distribution of Mt is 
Poissonian: 

Prob(M = m) = e"*— , (2.31) 
m! 

in agreement with the asymptotic form (12.111) of the law of M„. 

§ The shift of the index m by one unit is due to the fact that there is a record at A^i = 1, while the 
usual convention <o = holds in renewal theory. 
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Consider finally the ratios 



Nn = = exp{-Bt), Yn = = exp{Et), 

n 



Nm _ 






exp(- 


n 




Nm+1 




Nm 





Zn = = ^ = eMBt + E,), (2.32) 

where the backward and forward recurrence times are respectively defined as 

Bt = t- tM, Et = tM+i - t. (2.33) 

For a renewal process with exponentially distributed increments, the latter quantities 
are statistically independent of each other and have the same exponential distribution 
as the increments |16], i.e., 



/B,(S)=e-^, /^,(E)=e-^. (2.34) 

We can thus conclude that X„ is uniform between and 1, consistently with fl2.20p . 
that Yn is equal to the inverse of such a uniform random variable, consistently with 
what was suggested by (12.231) . and that Z„ is therefore equal to the inverse of a product 
of two such uniform random variables, namely 

fx{x) = i (o<x<i), fY{y) = -^ (y>i), 

y 

fz{z) = ^^ (z>l). (2.35) 
3. The Record-driven growth process 

The definition of the process given in the Introduction can now be made more 
quantitative, first at the level of the discrete formalism, then in the continuum limit. 

3.1. Discrete description 

Let kiin) denote the degree of node i (defined as the number of incoming links on this 
node) at the later time n > i. We have clearly ki{i) = 0. Then: 

• If rji is not a record, i.e., if the time i is not a record time, the degree of node i does 
not grow any further: 

ki{n) = for all n>i. (3.1) 

• If rji is a record, i.e., if the time i = Nm belongs to the sequence of record times, 
the degree of node i grows linearly with time until the subsequent record is born at 
time Nm+i- It then stays constant. Setting 

I = Nm, h{n) = Km{n), (3.2) 

one has therefore for the degree of the m-th record node 

^ . ._fn-Nm iNm<n<Nm+i), , . 
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The limit value of Km{n), that is Kfn{Nm+i) , is simply given by the inter-record interval, 
which we denote by 

\n+l = Nm+1 - Nm. (3.4) 

Figure [1] illustrates these definitions. 



K (n) 
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Figure 1. Schematic plot of the time dependence of the degrees K,n{n) of the first 
few record nodes (m = 1, ... ,4) in the RD growth process. The thick line shows the 
part of its history where the second record node (m = 2) is the leader. 



At any given instant of time n, there is a node whose degree is larger than that of 
all existing nodes at this time. We term this node the leader at time n. We will also 
say that a given node is a leader if it leads for some period of time during its history. 
For instance, on Figure 1, the record nodes 1, 2 and 4 are leaders, but 3 is not. If m 
denotes the number of record nodes before time n, i.e., < n < Nm+i, the degree of 
the leader at time n reads 

L(n) = mayi{Ki{n), K2{n),..., K^{n)) 
= max(A2, . . . , A^, n - N^) 
= max{Lm,n- N^), 

where 

= max(A2, . . . , A^) (3.6) 

is the degree of the leader, L{Nm), at the record time A^^ {m > 2). 

For the time being we focus our attention onto the degree of the leader at record 
times. We will return to the case of a generic time in section 14.31 Eq. (13.51) yields the 
fundamental recursion relation 

Lyn+i = max{Lm,Am+i). (3.7) 

The meaning of this recursion is as follows. 

• If A m+i > Lm, the m-th record node is the leader at time A^m+i- One has therefore 

Lm+l = ^rn+l = -^^(A^m+l)- 

• If Am+i < Lm, the m-th record node is not the leader at time Nm+i- The degree of 
the leader is left unchanged, i.e., L^+i = L^- It turns out that the m-th record node 
will never be a leader (see fl3.15p ). This node is said for short to be a subleader. 



(3.5) 
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The condition A^+i > is equivalent to saying that A^+i is larger than all the 
previous for k = 2, . . . , m, i.e., is a record for the sequence of inter-record times 
A2, A3, . . . Leaders therefore correspond to records for this sequence. On the example 
of Figure [1] we have L2 = A2 < -L3 = -L4 = A3 < L5 = A5, but A3 > A4. Thus the 
record nodes 1, 2, and 4 are leaders, and A2, A3 and A5 are records. 

Let 

Um = Prob(A„+i > Lm) (3.8) 

denote the probability for the m-th record node be a leader, or as mentioned above, the 
probability of a record breaking at step m + 1 for the sequence of inter-record intervals 
A2, A3, ... In section m we will determine the limit u of Um when m — >■ 00, that is, the 
probability for a record node to be a leader, in the limit of long times. At this stage we 
can give the expression of this quantity in the first two steps of the process. One has 
clearly ui = 1. Furthermore it is easy to see that the joint distribution of the record 
times A'^2 and N3 reads 

Prob(iV2 = J, N3 = k)= p,\ipk\j = ik>j> 1). (3.9) 

For N2 = j and A^3 = k, we have Li = 0, L2 = A2 = j — 1, and A3 = A; — j. The first 
record node, born at time A^^i = 1, is always a leader. The second record node, born at 
time N2 = j, is a leader if A3 > L2, i.e., k > 2j. This occurs with probability 

^ § £ {j-m-i)k = § (j-i)(2j-i) 

= 2(1 -In 2) = 0.613 705... (3.10) 

The expressions of Um become increasingly complex as m becomes larger and larger. 
Let us mention the following result without proof: 

1\ Ttt^ nVS , „,2 31n3 „ , 7 



a;3 = Li2 ^ + 3(ln2)^ 21n2 

V4/ 36 12 ^ ^ 4 z 

= 0.626 218..., (3.11) 

where the first term involves the dilogarithm function. 

We performed an accurate numerical evaluation of the Um by a simulation of the 
process based on a generation of records using eqs. fl2.19p . fl3.4p and fl3.7p . Figure [2] 
demonstrates the very fast convergence of the Um to the limit u, known as the Golomb- 
Dickman constant (see f l4.28p ). In contrast with the case of i.i.d. random variables, the 
record breaking rate of the sequence of A^ goes to a non-trivial constant. 

Denoting by u = 1,2,... the difference of the labels of any two successive leaders, 
say m and m -\- u, the "reign" of the leader born at Nm, i.e., the period of time during 
which it stays a leader, begins at time 

am = N^ + (3.12) 

and ends at time 

bm = O-m+u = Nm+u + Lm+v (3.13) 
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Figure 2. Plot of the probability ojm that the m-th record node is a leader, against m. 
Full symbols: exact values (|3.10p and (|3.1ip . Empty symbols: data for higher m, 
obtained by a numerical simulation based on the recursions ()2.19p . ()3.4p and ()3.7p . 
Dashed line: asymptotic limit given by the Golomb-Dickman constant w, eq. (I4.28p . 



The reign therefore has a duration 

(^m ^m+u '^m- (3-14) 

The statistics of these times is addressed in section 5. 
Finally, the inequalities 

< Nm+i < bm (3.15) 

prove that the m-th record node is a leader (for some period of time) if and only if it 
leads at time A^m+i, i-e., if and only if A^+i > Lm- 

3.2. Continuum theory 

As stated previously, the statistics of records is faithfully described in the regime of 
long times by a continuum approach. The late stages of the RD growth process have a 
similar asymptotically exact continuum description. 

In the continuum theory the key quantity is the ratio 

= (3.16) 

which is the fraction of nodes which are connected to the leader at that time. Indeed the 
numerator is the degree of the leader at time A^m? while N^n is equal to the number 
of nodes in the system. According to ( 13. 6p . Rm is also the scaled maximal inter- record 
interval. The values taken by Rm are clearly between and 1. Recalling f l2.24p . the 
recursion (13.71) becomes 

Rm+i = max(JJm+iRm, 1 " Um+l) , (3.17) 
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where Um+i is uniformly distributed between and 1 and independent of Rm- The 
branches of fl3.17p correspond respectively to the events 

Crn = {m leader} = lUm+i < , j Rm+i = 1 - 

L i + Km ) 

SSm = {m subleader} = \ Um+i > . , „ \ Rm+i = U^+iRm- 

L i + Km ) 

(3.18) 

The stochastic dynamical system defined by (13.171) plays a central role in the 
following. It is reminiscent of the recursions introduced by Dyson [17], then used 
extensively in the study of one-dimensional disordered systems [T8] . 



4. Statistics of the leader: one-time quantities 

4-1. Invariant distribution 
There is an invariant distribution 

associated with the random recursion (13.171) . The latter implies a recursion between the 
probability densities of Rm and Rm+i'- 

fR^^, = iL + S)fn^, (4.2) 

where we have introduced the linear operators L (leader) and S (subleader), acting on 
a function /(x) defined for < x < 1 as 

/•min(l,x/(l— x)) /■min(l,x/(l— x)) An, 

Lf{x) = / dufiu), Sf{x) = / -f{u). (4.3) 

Jo Jx U 

The invariant distribution therefore obeys the fixed-point equation 

fn = iS + L)fR. (4.4) 

Defining the variable V = 1/R, with density fv{v) for f > 1, eq. (14. 4p can be recast 
as the integral equation 

r r duufv{u) + l (l<w<2), 
v'Mv) = \ ' (4.5) 
/ duufv{u)+ dufviu) {v>2), 

which simplifies to the differential equation 

We readily obtain both the explicit result 

fv{v) = l il<v<2), (4.7) 

and the integral relation 

1 r°° 

fv{v) = - dufv{u) {v>2). (4.8) 

V Jv-l 
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Transforming back to the variable R, we find 

1 /■min(l.a;/(l— x)) 

h{x) = - ^ dufniu). (4.9) 

X Jo 

Using (14.71) as an input, the second hne of (14. 6 p can be solved iteratively. We thus 
obtain more and more complex analytic expressions for the invariant density fv{v) on 
the intervals delimited by the integers. We have /y(f) = (1 — ln(f — l))/v for 2 < t> < 3, 
whereas the expression for 3 < f < 4 involves the dilogarithm function Li2(2 — v). The 
corresponding expressions in terms of the variable R are /i?(-R) = 1/R for 1/2 < R < 1 
and fR{R) = [l + \n{R/{l- R))]/R for 1/3 < i? < 1/2. The invariant density fviv) has 
weaker and weaker singularities at integer values of v. Using (14.61) it is readily found 
that the n-th derivative of fv{v) has a discontinuity at f = n + 1 of the form 

M^v\n + 1) = f^v\n + 1+) - #)(n + 1") = (4.10) 

Figure [3] shows a plot of the invariant density /^(-R), obtained by a direct numerical 
simulation of the recursion (13.171) . This procedure is more accurate than for example 
a numerical inversion of the explicit form (I4.16P of the Laplace transform. The leading 
singularity at i? = 1/2 is clearly visible as the maximum of a symmetric cusp. The 
above expressions for the invariant density indeed yield //j(-R = 1/2 ± 0) = =f4. The 
other singularities at R = 1/3, R = 1/4, etc., are not visible. 




Another consequence of the differential equation (14.61) is as follows. Multiplying 
both sides of these equations by f^, where p is any complex number, and integrating 
over V gives 

/oo 
dvvP+^Mv) = -{{V+lY). (4.11) 



p{W). (4.13) 
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An integration by parts using the initial value /v(l) = 1 coming from 04.71] leads to the 
identity 

((^+in = i+p(n, (4.12) 

which holds for any complex p. It translates into an identity for the invariant distribution 
of the variable R (up to a change of p into its opposite) 

The explicit computation of the invariant distribution is more easily performed by 
introducing the Laplace transform 

/y(.) = (e-^^) = dvfy{v)e-'\ (4.14) 

Eq. (14.61) yields, using again the initial value /y(l) = 1, 

s^^ = e-^Uv{s)-l). (4.15) 
The solution of this equation reads 

fy{s) = l- e-^(^) = 1-8 e^-^(^) , (4.16) 
where we have introduced the functions 

E{s)= / Au = -Ei(-s) = -7-lns + F(s), 

Js U 

and Ei is the exponential integral function. The second expression of the function F{s) 
shows that it is an entire function of the complex variable s. The moments of V are 
readily obtained by expanding the second expression of f l4.16p in powers of s. They are 
rational multiples of e''': 

(V) = e^ (V^) = 2e\ {V^) = etc. (4.18) 

Let us finally determine the behavior of /y(f ) as f — > oo, or equivalently that of 
/i?(x) as X — > 0. This analysis is conveniently done along the lines of [19]. Anticipating 
a fast decay in the regime under consideration, we set 

Mv) r-. e-^^^\ (4.19) 

and approximate the integral in the right-hand side of ( 14. Sp by 

oo Q-<l>{v-l) 

dufviu)^— -, (4.20) 

v-l (p'{V — 1) 

obtaining thus 

^{v) - (f){v - 1) ^ \nv + \n^\v - 1). (4.21) 

Setting 

l = \nv, (f){v) =va{l), b{l) = a{l) + ^^, (4.22) 
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we obtain 

b{l) ^l + lnb{l). (4.23) 

The latter equation is correct up to terms of relative order 1/v, i.e., exponentially small 
in /. It is therefore exact to all orders in l/l. Its solution yields the asymptotic expansion 

a = / + /i + y-^^ + + ■••, (4.24) 

with the shorthand notation /i = In / — 1 = In(lnf) — 1. We thus obtain 

fv{v) ~ exp l^-v (\nv + \n{\nv) - 1 + ^"'^^'^J^ ~ ^ ^ " ') } ' 

The invariant density therefore decays faster than exponentially. It can be said to decay 
factorially, as the leading behavior of (14.251) is identical to that of l/v\. 

4-2. Probability for a record node to be a leader 

The knowledge of the invariant distribution fji allows the determination of one-time 
quantities in the late time regime of the process. In particular, consider the asymptotic 
probability uj for a record node to be a leader (for some period of time): 

u = lim = lim Prob(£m) = lim Prob ft/m+i < • (4.26) 



1 + Rm 



Thus, using the identity fl4.13p for p = 1, 



^ = ( ) = {R)= I dsfvis). (4.27) 



1 + R/ ' ' Jo 
Finally, using (14.161) . we obtain 



UJ 



r dse-'-^^''^ = dx exp ( T-^l = 0.624 329 988... (4.28) 
Jo Jo \Jo my J 



This number is known as the Golomb-Dickman constant [20]. It first appeared in 
the framework of the decomposition of an integer into its prime factors [21], then in 
the study of the longest cycle in a random permutation of order n [22l [23| |2^ . The 
connection of the present study to the statistics of cycles of permutations will be given 
in section 16. 1[ 

Eq. (14.271) shows that u is also the mean scaled maximal inter-record interval. 
4.3. Degree of the leader at a generic instant of time 

The invariant distribution f^ associated with the recursion (I3.17P gives the probability 
distribution of the fraction of nodes which are connected to the leader at a record time, 
in the late-time regime. We now solve the same question when the time of observation 
is a generic late time. 

Let an instant of time n ^ 1 be given. Eq. (13. 5p now reads 

L{n) = max(LM, n - Nm), (4.29) 
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where M = Mn is the fluctuating number of records before time n, and Lm = 
max(A2, . . . , Am)- Setting 

R(^n) = (4.30) 
n 

equation (14.291) leads to 

R{n) = max(i?MX„, 1 - X„), (4.31) 

where the ratio X„, defined in (12.321) . is uniformly distributed between and 1 and 
independent of Rm (see (I2.35P ). The recursion (I4.3ip therefore maps Rm onto R{n) in 
exactly the same way as the recursion (13.171) maps Rm onto Rm+i- As a consequence, 
the distribution of the ratio R{n) at a generic late time is also given by the invariant 
distribution f^. Stated otherwise, the fraction of nodes which are connected to the 
leader at a record time and at a generic time are identically distributedf" 



4.4- Probabilities for the current record node to be a leader 

Consider first the probability that the current record node at time n, that is, the 
record node number M„, is the leader at the current time n. This event requires that 
n — Nm > Lm, which implies 1 — > RmX^, or finally X„ < 1/(1 + Rm)- The 
variable Rm is asymptotically distributed according to the invariant distribution /^j, 
while X„, uniformly distributed between and 1, is independent of Rm- As a 
consequence, we find that the probability under consideration is again equal to the 
Golomb-Dickman constant uj. 

Consider now the probability that the current record node at time n, that is, the 
record node number M„, is a leader (for some period of time). This is a different quantity, 
larger than the previous one. Recall that by "a given node is a leader" we mean that it 
leads for some duration of time. It does not necessarily lead at time n, but will surely 
lead at time Nm+i (see (I3.15P ). Now the inequality to be satisfied is Nm+i — Nm > Lm, 
implying Z„ > 1 + Rm, where the time ratio defined in (12.320 . with probability 
density (12.350 . is independent of Rm- The probability under consideration therefore 
reads 

n= rmMR) r d.!H = /l±Ml+^\ = 0.914063... (4.32) 

JO Jl+R \ 1 + it / 

As expected, we have > cj. This inequality has yet another interpretation. The 
probability for the current record node to be a leader is larger than the probability lo 
for any record node to be a leader. This is so because, for a fixed time n, larger intervals 
Nm+i — Nm have a higher probabihty to be sampled. And the later the successor of a 
record node is born, the higher its probability to be a leader. 

II This property extends in a straightforward way to the full degree statistics (see section [6T2|) . The 
moments Ym'^ of any finite order p, introduced in (|6.5p . are also identically distributed at a record time 
and at a generic time. 
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5. Statistics of the leader: two-time quantities 

This section is devoted to a study of two-time quantities in the process, focusing our 
attention onto the main characteristics of the reign of a leader, defined in section I3.H 
i.e., the difference u of the labels of two successive leaders (section l5.ip . and the time 
ratios Nm+u/Nm, am/Nm, bm/Njn, Im/Njn (scctiou [5121) . All these quantities have limit 
distributions in the late-time regime. 



5.1. Statistics of the difference of the labels of two successive leaders 

Let us first consider the distribution of the random integer v = 1, 2, . . ., which is the 
difference of the labels of two successive leaders in the sequence of record times, m and 
m + u. Hereafter we consider m as being large enough, so that the process is described 
by the continuum formalism. We shift the record labels by m, so that the two leaders 
now have labels and u. We consistently denote the invariant density ffj by /q. The 
probability for a record node to be a leader reads 

Prob(/:o) = ^- (5.1) 
The distribution of z/ is encoded in the probabilities 

Pn = Prob(i/ = n) = — Prob(£o'S'5'i . . . SSn-iCn), 

Q„ = Prob(z/ >n) = - PToh{CoSSi . . . SSn-i). (5.2) 

to 

The following relationships hold: 

Pn Qn Qn ^ ^ Prri) 

m>n 

Y,Pn = Ql = l, {l^) = Y.^Pn = Y.Qn- (5-3) 

n>l n>l n>l 

In order to compute the P„ and Qn we consider the sequence of functions fn{,x) 
defined by 

fn = S^-''Lfo (n>l), (5.4) 

obtained as the result of the action, upon the invariant density /q, of the operator L, 
followed by n — 1 successive actions of the linear operator S. We have 
1 /-i 

Qn = - dufn{u). (5.5) 
UJ Jo 

Similarly, 

Pn=- f'dufniu)-^, (5.6) 

UJ Jo I + u 

the factor 1/(1 + u) being due to the last event £„ in the definition (15.21) . 
We have 

^ /„ = (1 - Sy'Lfo = fo (5.7) 

n>l 
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since (1 - S)fo = Lfo by or (1 - S)-^Lfo = /q. Eq. has a simple 

interpretation: the distribution of the variable R of the next leader is identical to that 
of the variable R of the current leader. Inserting the sum rule (15. 7p into (15. 5p yields, 
using (15.31) . 

(^) = E = (5-8) 

^1 ^ 

This result, too, has a simple interpretation: the mean distance (u) between two 
consecutive leaders along the sequence of record times is the inverse of the probability u 
for a given record to be a leader, or the inverse record breaking rate for the sequence of 
inter-record intervals. 

We now perform the computation of the cumulative probabilities Qn- Let 

/i = Lfo, (5.9) 

i.e., using (14.31) . 

r 1 for 1/2 < X < 1, 

f^(x) = { r^ld-^) , ^ , (5.10) 

dM/o(n) for0<x<l/2, ^ ' 

or finally, using (14. 9p . 

h{x)=xh{x). (5.11) 
The function /i(x) is the initial condition for the recursion 

Ui{x) = SUx) = / —fn{u). (5.12) 

Jx U 

We make the change of variable y = 1/x and define gn{y) = fn{x). These functions 
obey the recursion 

fy dv 

9n+i{y) = / —gn{v), (5.13) 

Jmax(l,y-1) V 

which assumes a simpler form in terms of Laplace transforms: 

Us) = (y3;z7^-+i(^)) • (5-14) 
Consider the generating function of the gn{s), 

G{x,s) = J2g^is)x^-\ (5.15) 

n>2 

From (I5.14P we obtain the differential equation 

x{G{x, s) + g,{s)) = ")) ' ^^-IS) 

whose solution reads 

G{x, s) = x e-"^(^^ / dte"^(*)^i(t). (5.17) 



s 

Furthermore, from (I5.1ip it is easily found that 
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Coming back to the Qn, we have 

Qn = - dufM = - dt;^ = -/ dssUs). (5.19) 

00 Jo UJ Jl UJ Jo 

Introducing the generating function of the Qn, 

S(x) = 5:g„x"-\ (5.20) 



n>l 

we obtain 



1 r°° 

S(x) = l + -/ dssG{x,s). (5.21) 

JO 

The expressions (15.171) and (15.181) yield the exphcit result 

S(a;) = - / dse-^^W / dt e^^+^^-^^^W. (5.22) 

UJ Jo Js 

Several results of interest can be derived from this exact expression. First of all, 
taking the derivative at x = 0, we obtain 



Q2 



S'(0) = - r-{t-l + e-*) e-*-^W. (5.23) 
UJ Jo t 

The integral can be evaluated to yield 

Q2=^-^. i.e., Pi = ^. (5.24) 

UJ UJ 

This can alternatively be obtained using (I5.6P and (15.111) . The probability that two 
successive records are leaders therefore reads Prob(£o'^i) = ^Pi = 1 — uj = 0.375 670 . . . 
This value is close to what it would be in the absence of any correlation, namely 
= 0.389 788... 

The moments of v can be derived by expanding ( I5.22p around x = 1. We thus 
recover the result (15. 8p for (z/), as S(l) = 1/uj. For the second moment (z/^), we find 

{v^)=Y. n'P„, = 5: (2n - 1)Q„ = - + 2S'(1) 

n>l n>l ^ 



1 

UJ 



1 + 2 /'"^e-^W r ^e-*(l-e-*) 

S J s t 



= 3.383 695... (5.25) 

The expression (15.221) also shows that S(x) has no singularity at any (finite) value 
of X. In other words, it is an entire function of the complex variable x. As a consequence, 
the Qn fall off at large n faster than exponentially. This means that Qn+i/Qn 0, so 
that Pn ~ Qn- From a quantitative viewpoint, the fall-off of the Qn (i.e., of the Pn) can 
be derived from the asymptotic behavior of S(x) as Rex ^ +oo. In this regime, the 
double integral in (I5.22p is dominated by small values of s, where F{s) ~ s, and large 
values of t, where F{t) ^ Int + 7. We thus obtain 

p-yx 1 / 27r 

S(x) ^ — r(x) ^ -W — e"(i°"+^-i) (Rex ^+00). (5.26) 
ujx V a; 
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The contour integral representation of Q„, 
J ivri x" 

can then be evaluated by the saddle-point method. Parametrizing the saddle-point as 
Xc = n/a, we are left with the estimate 



P ^ O 



-n(a— 7— 1+1/a) 



(5.28) 



where a obeys the implicit equation 

a = ln(n/a)+7. (5.29) 
Setting A = Inn + 7 and yU = InA = ln(lnn + 7), we obtain the asymptotic expansions 

a = A-. + ^ + ^^<f^ + ... (5.30) 

and 

(5.31) 



Q„ ~ exp <^ -n I In n - - IH h ^ + ■ ■ 



The above expansions are very similar to ( 14.24^ and ( 14.25^ . The Pn can again be said 
to decay factorially, as their leading behavior coincides with that of l/n\. 

The accuracy of the estimate (15.281) is demonstrated in Figure HI showing a 
surprisingly good agreement between this prediction and numerical data for the proba- 
bility distribution obtained by a direct numerical simulation of the recursion fl3.17p . 




Figure 4. Probability distribution of the difference of labels v between two successive 
leaders. The logarithm of P„ = Prob(i^ = n) is plotted against the integer n. Symbols: 
data obtained by a direct numerical simulation of the random recursion (|3.17|) . Line: 
value of the asymptotic estimate (|5.28p . obtained by means of a numerical solution of 
the implicit equation (|5.29|) . 
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5.2. Statistics of the lengths of time associated with the reign of a leader 
We now turn to the statistics of the time ratios 

-^m+u (^m n \ ^rn rj^^N 

Pm — ; Q^m — ' Pm — ~ "/V~' [O-OA) 

-''m -''m -''m -''m 

where A'm and Nm+v are the birth times of the current leader and of the next one, and 
where the beginning and ending times, and 6m, and the duration of the reign 1^ of 
the current leader are defined in (13.121) . fl3.13p and (13.141) . 

We again shift the record labels by m, so that the two leaders under consideration 
have labels and z/, whereas the intermediate record nodes, labeled m = l,...,z/ — 1, 
are not leaders. We have therefore, using (I3.17P 

Ri = l-Ui, 
R2 = {1-U,)U2, 

Ru = {l- Ui)U2 ...U,. (5.33) 

The above time ratios can therefore be expressed in terms of the variables i?o, Ri 
and Ri, as 

ao = 1 + i?o, (5.34) 

Ao = /5o - "0- (5.36) 

These time ratios have well-defined limit distributions /„, /p, //j, f\ in the late-time 
regime. This reflects the temporal self-similarity of the process. 

Figure [5] shows plots of the probability distributions /p, fp, f\ of the three time 
ratios pQ, Pq and Aq, and of the distributions fi/p, /i/a, fi/x of their inverses. The data 
have been obtained by a direct simulation of the recursion (13.171) . The plots emphasize 
the following characteristics. The three distributions fp, fp, fx share with the invariant 
distribution fji the property that their maxima correspond to cusps. The values at 
which these cusps occur, namely Pcusp = 2, /3cusp = 3 and Acusp = 1, are readily obtained 
by replacing in (I5.35p . (I5.36P both variables Ri and R,^ by -Rcusp = 1/2. The lower plot in 
Figure [5] demonstrates that the distributions of the inverse time ratios have well-defined 
limits at zero: 

/i/p(0) = A /i//3(0) = /i/a(0) = R (5.37) 
In other words, the distributions of these three time ratios fall off as l/x"^: 

/p(x)^4' Mx)^fx{x)^^. (5.38) 

The exact values of the amplitudes A and B will be evaluated below (see (I5.49p - (l5.5ip ). 
The fact that fp and fx share the same fall-off amplitude B is simply due to the fact 
that the difference /3o — Aq = ao is bounded in the range 1 < ao < 2 (see (I5.34p ). 
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Q.4i 




Figure 5. Top: plot of the probability distributions /p, /a, f\ of the time ratios 
introduced in (|5.35[) . (|5.36p . Bottom: plot of the probability distributions of their 
inverses. Symbols: exact values A and B of the densities at the origin (see (|5.49[) - 
(|5.5ip ). The dashed lines emphasize the cusps at pcusp — 2, /3cusp = 3 and Acusp = 1- 



The actual evaluation of the joint distribution of the variables Ri and is now 
performed. The starting point is to consider the probability 

P(ri, r) = Prob(/?i <ri,R^<r, Co), 

= Prob(i?i < ri,Rn < r,Co ,u = n), 

n>l 

= Prob(i?i < n, i?„ < r, Co SSi... SS^-i (5.39) 

n>l 

Note that -P(l, 1) = Prob(£o) = ^- The normalized joint density that we are looking 
for is thus 

/«.K.(ri,r) = iA|.p(,^,,). (5.40) 



As in section |57T| we start from the invariant distribution /q. Acting on it with the 
operator L yields /i (see f l5.9p ). We then fix the value of -Ri to ri, thus obtaining the 
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density {1 / u) fi{ri)6 {r — ri) . We then define 

<^iiri,r) = S{r ~ri), (5.41) 
and introduce the functions 

'fn{ri,r) = 5'""Vi(n,'"), 

^in,r) = J2^niri,r) = {1 - Sy^(pi{ri,r), (5.42) 

n>l 

where the operator S acts on the variable r. 
We are thus left with the result 

fRi,R.{ri,r) = -/i(ri)(^(ri,r) (5.43) 
uj 1 + r 

where the factor 1/(1 + r) comes from the last event £„. We now make the change of 
variable y = 1/r, yi = 1/ri, and define ipnivi, y) = '^nij'i^ and ip{yi, y) = (p{ri,r). The 
Laplace transform of ipi{yi,y) is 'ipiiyi, s) = yfe''^^^. Therefore, in analogy with (I5.17p . 
the Laplace transform of ^ip^yi, y) reads 

^l;{y^, s) = y\ ( e"^^^ + ^ e'^^^^ j dt e^^*)-*^^ j . (5.44) 

The expression (15.431) of the normalized joint probability density of the variables R\ 
and Ry therefore contains two non-trivial factors: = Qiiyi), whose Laplace 

transform with respect to yi is given by fl5.18p . and ip{ri,r) = ip{yi,y), whose Laplace 
transform with respect to y is given by fl5.44p . The knowledge of this distribution allows 
one, at least in principle, to compute the distribution of the time ratios po, Aq, Po and of 
similar quantities. Analytical expressions thus obtained are however very cumbersome, 
and therefore of little use, either theoretically or practically, except for some simple 
examples, such as the amplitudes A and B, which will now be evaluated explicitly. 

Consider first the amplitude A. The probability density fi/p can be expressed as 

fi/p{x) = dri dr fR^^n,{ri,r) 6\x 1 (5.45) 



r 



dr-— -^/R,KJ^,r . (5.46) 
[r + x)"^ \r + X ) 

This expression simplifies in the x — limit to yield 

^=/i/p(0)= rdr/^„^,(l,r)l = i /^\ . (5.47) 
JO r UJ \Rvl R^=\ 

We obtain similarly 

i3 = /i/.(0)= rdr/«,^,(l,r)^ = i /l + ^\ , (5.48) 



hence the relation 



UJ 

The explicit form of B is 



B = A+-. (5.49) 



B 



1 rl Ay 1 rco ^ 1 roo 

- —if{l,r) = - ds^{l,s) = - dte-*+^W, (5.50) 
UJ Jo r UJ Jo UJ Jo 
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where the double integral resulting from the insertion of the expression (I5.44p of s) 
has been simplified by an integration by parts. We thus obtain the numerical values 

^ = 2.127451..., i3 = 3.729168... (5.51) 

6. Connections to other fields 

6.1. Records and cycles of permutations 

As noted above, the asymptotic probability for a record node to be a leader is equal to 
the Golomb-Dickman constant, a number which appears in problems of combinatorical 
nature. It is for example the limit, when n —>■ oo, of (A„)/n, where A„ is the length 
of the longest cycle in a random permutation of order n [22l [23| [2l] . This number also 
appears in the framework of the decomposition of an integer into its prime factors |21j . 

Furthermore, as shown by Goncharov [23] and Shepp and Lloyd [21], A„/n TZ, 
where the distribution of the limiting random variable TZ coincides with the invariant 
distribution found in the present work. The identity of the asymptotic probability 
for a record node to be a leader and of the Golomb-Dickman constant is just the identity 
of {R) (see eq. ( lOTD ) and (7^). 

We now explain the origin of the coincidences between features of the statistics of 
records and the cycle structure of permutations. The existence of connections between 
the two fields is well known [3, [25j. In particular, M„, the number of records up to 
time n, has the same distribution as the number of cycles C„ in a random permutation of 
order n [7], as mentioned in section [2711 There is actually a deeper relationship between 
the sequence of record times on the one hand, and the cycles of a random permutation 
on the other hand, which is due to the fact that the latter can be generated by the same 
set of indicator variables as the former. 

For records, these variables are the Jj defined in section 12. 1[ For cycles of 
permutations the construction is due to Feller [13], as we now recall. A permutation of 
order n is constructed by a succession of n decisions. Let Oi, 02, be the n letters 

of the permutation. The position of ai is first chosen, with n possibilities: 1 —>■ i. Then 
the position of is chosen, with n — 1 possibilities: i ^ j, and so on, until a cycle is 
formed. For example [13], with n = 8, choosing 

1 ^ 3 ^ 4 ^ 1 

2^5^6^8^2 

7->7, (6.1) 

generates the permutation Define the indicator variable J^, equal 

to 1 if a cycle is formed at the A;— th step, else to 0. For this example we have J3 = 1, 
J7 = 1, Js = 1. Clearly, in general, 

= 1) = ^^3^^. (6.2) 

and the Jk are independent. Here the sequence of reads 00100011, with Prob( J3 = 
1) = 1/6, Prob(J7 = 1) = 1/2, Prob(J8 = 1) = 1. Reverting the sequence of steps. 
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we obtain 11000100 for the new indicator variable, that we denote by Jj say, such 
that Prob(/i = 1) = 1, Prob(/2 = 1) = 1/2, Prob(J6 = 1) = 1/6. One recognizes the 
construction of a sequence of records. 

6.2. Random breaking of an interval 

We have been, up to now, mainly interested in the statistics of leaders. We now address 
the full statistics of node degrees, within the continuum approach. Consider a fixed late 
record time N^- The degrees of the earlier records read Kj_i{Nm) = Nj — Nj_i = Aj 
for j = 2, . . . ,m. These degrees sum up to each of them representing a finite 

fluctuating fraction of the total, or "weight" Aj/Nm- We have 

— = 1-Um, 

%^ = t/^(l-f/„.-l), 

%^ = ?7^?7„_i(l-t/„_2), ... (6.3) 
Relabeling the indices j^k = m — j + and denoting the weights by Wk, we have 
W, = l-U,, 

W2 = Ui{l-U2), 

Ws = U,U2{l-Us), ... (6.4) 

We recognize the sequence of weights obtained by randomly breaking an interval of unit 
length into two pieces, and iterating the process [26] . 

A convenient tool to investigate this kind of fluctuating weights consists in 
introducing the reduced moments 

Yi'^ = E ( ^ ) = E W^r, (6.5) 

where the order p = 1,2,... is any integer [26]. These moments obey the random 
recursion 

Yii, = Ul^.Yi^^ + (1 - U^^.y. (6.6) 

The latter can be viewed as a generalization of the recursion fl3.17p to any finite integer 
order p. Eq. (13.171) is formally recovered in the p ^ oo limit, where the sum in (16.51) is 
dominated by its largest term, that is by the contribution of the leader. The YJ;P\ which 
keep fluctuating in the late-time regime, have non-trivial limit distributions, invariant 
under the dynamical system (16. 6p . A plot oiY^\ obtained by iterating (16.61) numerically, 
as well a plot of the invariant measure fn of the largest weight can be found in [26] . In 
the framework of the random breaking of an interval, u is either the probability that 
the first weight Wi be the largest, or the mean maximal weight. 

% The sum is actually equal to Nm ~ 1, but we neglect the correction in the continuum limit. 
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6.3. Relation to the Kesten variable 

It turns out that the invariant distribution of the random variable R can be worked 
out more generally for a one-parameter family of problems containing the above as a 
special case. Consider the recursion fl3.17p where the i.i.d. random variables Um+i have 
an arbitrary distribution between and 1, with density p{u). The invariant distribution 
still obeys the fixed-point equation (I4.4p . with the definitions 

.mm(l,x/(l-x)) /X\ 

Sf{x) = —P[-) /W' 

/■min(l,a;/ {l—x)) 

Lf{x)=p{l-x) dufiu). (6.7) 



(6.9) 



The integral equation thus obtained cannot be solved in closed form in general. It 
nevertheless leads to a differential equation similar to (14. 6p . and is therefore solvable, 

whenever the density of the variables Um+i is a power law. More precisely, if 

p{u) = bu''-\ (6.8) 

where b is an arbitrary positive parameter, one has 

i ( (^^-(^)) - (^ - + - 

^fO {l<v<2), 
\-bfviv-l) iv>2). 

Consider the modified Laplace transform 

fy{s) = {V' e~'^) = dvv'e-^- fv{v). (6.10) 
This quantity obeys the differential equation 

' = + " (6-11) 
whose normalized solution reads 

/^(s) = Y^^^-s-ms) = T{b + l)e'^e-'-'^^'\ (6.12) 
The probability u (see eq. f l4.26p ) generahzes to 

uj{b) = Jiin Prob(l - t/^+i > f/^+ii?™) = ((y^)') ' (6.13) 



^ib) = ({j^] ) = ^ I dss'-'e-^Ms), (6.14) 



that is 

,,,(h\ = I ( , , _ 

y + l) I T{b)Jo 

i.e., finally 

u{b) = / dse-^-''^^^). (6.15) 
Jo 

The original problem with a uniform distribution of the variables Um+i is recovered 
by setting 6 = 1 in the above results. One has indeed /v(s) = — d/v(s)/ds. 
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and an integration by parts shows that fl6.12p is equivalent to fl4.16p . Finally, the 
expression (16.151) for ci;(l) coincides with the expression (14.281) of the Golomb-Dickman 
constant. 

It can be checked that uj{b) = {R) for all values of b. This identity generalizes fl4.27p . 
However, linim^oo Prob(l — Um+i > Um+iRm) and {R) are not equal for an arbitrary 
distribution p{u). 

The above family of exactly solvable invariant densities is in correspondence with 
the following problem. Consider the Kesten variable [271 EH] , defined as 

Z = 1 + xi + X1X2 + X1X2X3 H , (6.16) 

where the Xffi 1.1. d. positive random variables with probability density p^^'^^\x). If 
this distribution is such that (Inx) < 0, the sum in (16.161) is convergent and Z has a 
well-defined probability density f^^^^\z), solution of the integral equation 

fr'\z) = r-p'^-\x) jf-'C-^) , (6.17) 

Jo X \ X J 

This equation again cannot be solved in closed form in general. It is however 
known [2Sl 121] that the problem can be solved whenever the density of the variables x^. 
is a power law on an interval [0, a]. In the marginal situation (a = 1) where the Xm are 
between and 1, with density 

p(Ke«)(a;) = (6.18) 

where h is again an arbitrary positive parameter, the Laplace transform fz^^^\s) = 
(e~^^) has the closed-form expression 

jf-^)^s) = e-'-'^^'\ (6.19) 

The similarity between the two problems is now patent by comparing (16.121) and (I6.19p . 
The probability densities of the variable V = 1/R and of the Kesten variable Z are 
related to each other by the equation 

x'Mx) = T{b + l)e'^ft'\x). (6.20) 



7. Conclusion 



The main goal of this work has been to put forward the Record-driven growth process. 
This ballistic growth model entirely based on the record process has been met as the 
zero-temperature limit of a class of network growth models with preferential attachment. 
Its simplicity and its minimality however suggest that the RD growth process might be 
relevant to a wider class of situations, besides the realm of complex networks. The 
main emphasis has been put on the interplay between records (i.e., nodes endowed with 
the best intrinsic qualities) and leaders (i.e., nodes whose degrees are the largest). The 
RD growth process provides a natural playground where subtle questions related to the 
statistics of leaders and of lead changes can be addressed in a quantitative way. 

The RD growth process inherits from the record process some relationships with 
combinatorical problems related to permutations. Relationships with fragmentation 
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models and with one- dimensional disordered systems have also been underlined. A key 
feature of the RD growth process is its temporal self-similarity in the late-time regime, 
inherited from the underlying record process, which manifests itself in that various time 
ratios have non-trivial limiting distributions in the regime of late times. This regime is 
also characterized by the very fast fall-off of temporal correlations. 

Let us point out the recent work [30], which addresses the statistics of records 
for the successive positions of a random walk. The mean longest duration of a record 
scales with the number of steps, the ratio defining a non-trivial constant 0.626 508 . . . 
This parallels the scaling of the mean maximal inter-record interval of the present work, 
resulting in the occurrence of the Golomb-Dickman constant 0.624 329 . . . 

To close up, it is worth looking back to our starting point, namely the growing 
networks with preferential attachment considered in the Introduction. The self-similar 
growth regime of the RD process can be shown to be unstable against thermal 
fluctuations. This regime crosses over to a complete freeze-out at a time scale which 
usually diverges at a power-law at low temperature, as r ~ T~''. The model-dependent 
exponent b can be evaluated by generalizing the line of thought of the recent work [31] , 
whether or not the model has a finite-temperature condensation transition. 
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